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_ Abstract 

In multiple-input multiple-output (MIMO) spatially multiplexing (SM) systems, achievable error 
rate performance is determined by signal detection strategy. The optimal maximum-likelihood detection 
c/5 ' (MLD) that exhaustively examines all symbol candidates has exponential complexity and may not 

be applicable in many practical systems. In this paper, we consider a generalized minimum mean 
square error (MMSE) detection derived from conditional mean estimation, which in principle behaves 

> ■ 

[- — ' equivalently to MLD but also includes a linear MMSE detection as a special case. Motivated by 



this fact, we propose a low-complexity detection which significantly reduces the number of examined 



symbol candidates without significant error rate performance degradation from MLD. Our approach is 

to approximate the probability density function (pdf) of modulated symbols that appears in the exact 

conditional mean expression such that the decision metric can be cast into a partially closed form. It 

is found that uniform ring approximation in combination with phase shift keying (PSK) and amplitude 
•<-h , 

, phase shift keying (APSK) is promising, as it can achieve a performance even comparable to MLD, 

S_j ■ 

. while its complexity is linear when the number of transmit antennas is two. 

Index Terms 

MIMO, spatial multiplexing, conditional mean estimation, MMSE, MAP detection 

I. Introduction 

Multiple-input multiple-output (MIMO) spatial multiplexing (SM) is a key air-interface tech- 
nology indispensable for implementing high data-rate wireless communications with limited 
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frequency resources. The enriched capacity of the MIMO-SM systems is simply a consequence 
of the fact that different signals are transmitted from plural antennas at the same time slot sharing 
the same frequency band. However, since these transmitted signals travel through the wireless 
channel where they are linearly superimposed, the receiver must properly unravel the mixture of 
the signals. Such an operation, variously called demultiplexing, signal detection, or estimation, 
almost entirely determines achievable performance in terms of error probability, and therefore it 
has been one of the most important issues in a multi-antenna system design. A number of signal 
detection frameworks, ranging from simple to sophisticated, have been hitherto proposed [1,2]. 

The MIMO signal detection in the simplest form is the linear detection (LD) typically designed 
based on the zero forcing (ZF) or the minimum mean square error (MMSE) criterion. The LD, 
though attractive for its simplicity, suffers from limited achievable diversity order which results 
in poor error rate performance especially when the number of receive antennas is small [1]. 
Successive interference cancellation (SIC) originated in multi-user code-division multiple access 
(CDMA) [1] is an extension of LD implementable with incremental complexity, where LD is 
iteratively performed to replicate and subtract interfering symbols. However, the performance 
improvement is not significant even at the cost of the delay due to the iterative process. Recently, 
it has been reported that, when preceded by a nonlinear operation called lattice reduction (LR), 
LD or SIC can achieve steady signal detection performance even with an ill-conditioned channel 
matrix [3-5]. The idea behind LR is to transform the lattice from which a signal constellation is 
drawn, such that the effective channel matrix becomes better-conditioned. The major drawbacks 
of the LR method include that 1) the complexity of an LR algorithm increases as the channel 
changes rapidly, and that 2) soft decision is difficult to be made since hard decision is essential 
on the transformed lattice. The former issue has been addressed with a low-complexity LR 
algorithm [6] and the latter can be partially overcome by list detection [7], but there remains 
another limitation that it is not applicable to constellations having no lattice structure such as a 
phase shift keying (PSK). 

While the studies on LD are mainly to assist it for an enhanced performance, those on the 
exhaustive search techniques such as maximum likelihood detection (MLD) and maximum a 
posteriori (MAP) detection are to reduce their immense search space. For example, in [8, 
9] reduced-complexity MLD techniques are presented, where a provisional estimate is first 
determined with LD and then only its vicinity is exhaustively searched. A similar but more 
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advanced approach is proposed in [10], where the likelihood maximization is done only with 
addition by exploiting the fact that a single bit flip in a square QAM constellation partially 
retains the former likelihood. Moreover, QR decomposition (QRD) aided schemes are extensively 
studied, where the channel matrix is decomposed by QRD so as to create a tree structure on 
the search space. With this, M-algorithm or sphere decoding (SD) can be employed for efficient 
search [11], and also several extensions have been proposed to curtail the search space further [12, 
13]. It also enables the sequential Monte-Carlo (SMC) technique that can efficiently calculate a 
posteriori probability of symbols [14]. A major drawback common to these QRD-aided schemes 
is that, due to the sequential structure, multiple symbols cannot be detected in parallel. 

In this paper, we focus on a generalized MMSE detection with no linearity constraint, which, 
to the best of our knowledge, has not been considered in the context of MIMO signal detection. 
It is literally a nonlinear but optimal processing in the sense that it indeed minimizes mean 
square error (MSE) by actually computing the exact conditional mean for given received sam- 
ples. A straightforward implementation of this generalized MMSE detection, however, involves 
exhaustive summation over all possible symbol patterns as in the case of MLD. Hence, there is 
no clear advantage over MLD in terms of processing overhead. In spite of this fact, we were 
inspired by this generalized structure since it has an interesting theoretical aspect from which we 
develop the idea of a completely new signal detection framework. Specifically, it is known [15] 
as well as shown in this paper that the generalized MMSE detection reduces to the conventional 
LD designed with the MMSE principle (i.e., linear MMSE), when all the modulated symbols 
are hypothetically assumed to be Gaussian distributed. Hence, even though as complicated as 
MLD in the general form, it also includes the least complicated scheme as a special case. This 
fact motivated us to derive a new detection scheme that balances performance and complexity, 
where the key idea is to approximate a selected one (rather than all) of the multiplexed symbols 
by a continuous random variable (RV) whose distribution should well capture a given shape of 
the discrete constellation. As a result, the exhaustive summation entailed in the generalized form 
of the MMSE detection is partially integrated out. In this manner, it is possible to reduce the 
complexity order from M Nt to M Nt ~ l , where M denotes modulation multiplicity and N t the 
number of transmit antennas. The benefit of this complexity reduction becomes apparent when 
N t is relatively small: in the case of N t = 2, the complexity reduces to a linear order of M, and 
hence the implementation has now become feasible with limited computational resource. 
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The rest of this paper is organized as follows: our system model and objective are stated in 
the next section. In Section III, some existing signal detection methods closely related to our 
approach are briefly reviewed, followed by the description of our new scheme in Section IV. 
Section V is devoted to numerical evaluation of error rate performance and Section VI concludes 
the paper. 

We use the following notations: for a vector or a matrix, T and H denote its transpose and 
Hermitian transpose, respectively. For a complex number, * denotes its conjugate. The entire set 
of real and complex numbers are denoted by R and C. An integral in the form of J £N d N x 
means an integration over iV-dimensional complex field, In denotes the identity matrix with the 



dimension of N, and | \x\ | = \/x u x is the norm of a given vector x. For RVs which are denoted 
in upper cases, their lower cases stand for realizations. The functions Px(x) and px(x) represent 
the probability and pdf of a given RV X, respectively. The same notational rule applies to the 
vector of RVs; i.e., Px( x ) an d Px( x ) denote the joint probability and joint pdf, respectively, 
for a vector of RVs X. The logarithm for complex numbers always takes the principal branch. 



II. System Model 

We consider a MIMO system with N t transmit and N r receive antennas (N t < N r ) as depicted 
in Fig. [U The input-output relationship of the channel is given by 



Y = HX + W. 



where 



X ■= 


X l ■ 


■ ■ X Nt 


Y := 


Y 1 ■■ 


■ Y Nr 


W : = 


Wi ■ 


■■ W N 


// : = 


h x ■ 


■ h Nt 



1 T 



h. 



hi 



h 



N r ,n 



Kn< N< 



(1) 

(2) 
(3) 
(4) 
(5) 



are a transmit symbol vector, a receive symbol vector, a noise vector, and a channel matrix, 
respectively. Assuming no beamforming before transmission, each transmit symbol X n , n = 
1, . . . , N t , is identically independently distributed (i.i.d.) and takes with an equal probability a 
value from the set of M signal points (i.e, constellation), Xu- Each element in the noise vector 
is also i.i.d. and follows the complex Gaussian distribution with zero mean and the variance N . 
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The task of the signal detector is, for a given event Y = y, to calculate the corresponding 



estimate of transmit vector X :- 



X, ■■■ X 



Ni 



where we assume that H, N , and Xu are 



a priori knowledge available upon calculating X at the receiver. 

III. Signal Detection Methods 

A. Linear Detection (LD) 

In LD, its detection operation is restricted to linear arithmetic, and hence X can be written 
in the form of 

X = Gy, (6) 

where the matrix G is adapted to H usually based on the ZF or MMSE principle. In the ZF 
principle, G corresponds to the pseudo-inverse of H: 

G ZF = {H H H) X H H , (7) 

which thus completely inverts the linear transformation made in the channel. However, since it 
adversely amplifies the noise term W at the same time, the following MMSE option is preferable 
in most cases. 

The detection matrix G that satisfies MMSE is by definition written as 

|2 



Gmmse = &rgmmE x ,w 

Cjr 



X - X 



(8) 



The use of the orthogonality principle is the simplest way to derive G M mse satisfying the 
definition above: 



:x-x)y h 



0, (9) 



Ex,w 

from which G M mse can be developed as 

Gmmse = E xyv [XY H ] (E XtW [YY 11 ])^ = H H (HH H + NqI^ 1 . (10) 
As shown in Appendix HI this has another form similar to CD) as 

Gmmse = (H H H + N^)' 1 H H . (11) 
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B. Maximum Likelihood and Maximum a Posteriori Detections 

In MAP detection, X is given as the one replica of possible transmit vectors that maximizes 
a posteriori probability P x \Y(x\y): 

X = arg max Px\y(x\y). (12) 

Taking logarithm of Px\y{x\y) and performing some manipulations on this formula, it follows 
that 

X = arg min \\y — Hx\\ 2 — N \ogPx(x). (13) 

Without a priori knowledge, all possible x should be assumed to be equally probable. In this 
case, the second term that depends on \ogPx(x) can be eliminated so that it reduces to the 
well-known form of MLD: 

X = arg min | \y — Hx\ | 2 . (14) 

C. Conditional Mean Estimation 

In Section [Tll-Al we saw that liner transformation of y by Gmmse minimizes MSE. However, 
with no linearity constraint, the linear MMSE is no longer optimal and the conditional mean 
estimation expressed as 

X = E x [X\Y = y] (15) 

is known to minimize MSE [15]. The formula above suggests simply that the optimal estimate 
should be the mean of the possible transmit vectors whose corresponding receive vector is y, 
and it is evident that MSE is minimized by this principle. 

Let us replace the expectation in (fl"5l) explicitly by summation and a posteriori probability 

Px\Y{x\y): 

X= xP X \Y(x\y). (16) 

xexg 

This conditional mean estimation expression can be interpreted as a soft-output MAP detection 
that weights all the possible patterns by their a posteriori probabilities, rather than finding the 
most probable one. Thus, if a hard decision is made on X, the performance is the same as 
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the MAP detection (also the same as MLD because of the i.i.d. assumption of X). In order to 
actually compute (fTBT) . the Bayes' theorem is invoked to rewrite the equation in terms of squared 
distance as 

X = e M , (17) 



A(«) := exp ( - llV A HXI ) . 



where 



N 

An example of this type of MMSE detection is found in [16, 17]. 

Upon deriving ([TBI from (fl"5T) . we have made use of the fact that each symbol X n , n = 
1, . . . , N t is a discrete RV having finite realizations. Let us temporarily presume that X n be a 
continuous RV having a pdf p Xn (x n ), and rewrite (fl"5T) as 



_ J £Nt xp x (x)X(x)d Nt x 
j C N t Px{x)\{x)d N ^x ' 



x = i x PX]Y (x\y)d*x = t 



From this formula consisting of multivariate integrals, the following interesting fact results: 

Proposition 1: If one substitutes a Gaussian pdf for px n (x n ) = ie - ' 2 ™' 2 (in vector notation, 
Px{x) = ^wT-e - " 3 '" 2 ), then the multivariate integrals in (fT9l) are completely solvable, and more 
importantly, the simplest form of the developed result coincides with X = G M mse?/- 

Proof: See Appendix HD ■ 
Hence, the conditional mean estimation with Gaussian approximation of symbols reduces to 
the linear MMSE [15]. One reason for significant performance gap between the linear MMSE 
and MLD stems from the fact that the former implies approximating the symbol distribution 
by Gaussian. Since the distribution of the transmitted symbols in general considerably deviates 
from Gaussian, this approximation may cause a significant discrepancy in the metric calculation. 

IV. A New MMSE Receiver Based on Symbol Distribution Approximation 

In the previous section we have shown that, if we keep the symbol distribution intact, then 
the conditional mean estimation performs as optimally as MLD; on the other hand, it is reduced 
to the linear MMSE if we approximate the pdf by Gaussian. Motivated by this fact, we explore 
another approximation that does neither significantly lose a property of original constellation 
shape, nor require any integration. Specifically, we seek a good approximation of pdf with 
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respect to a single variable in the multivariate integrals, so that the estimation X is cast into a 
partially closed-form. As a result, complexity order is reduced from M Nt to M^'" 1 . 

In what follows, we restrict our attention to the case with N t = 2 since in this case the 
complexity of MIMO detection reduces to linear order, which is most beneficial in practice. 
Note that the scenario with N t = 2 is considered to be practical in many MIMO systems, e.g., 
in the uplink from a small mobile terminal to a base station, where the terminal cannot possess 
more than two antennas due to size limitation. 



In the case of N t = 2, X and H are 2 x 1 and N r x 2, respectively: 

r i T 

X\ X2 1 



x 

H 



hi h z 



l N r ,n 



T 



n 



1,2. 



(20) 
(21) 



In order to detect X\ and X 2 separately in parallel as in Fig. [Q we rewrite (fTTT) element-wise as 

Y, Xl ex M T /X2 ex M x nPx 1 (xi)Px 2 (x 2 )\(x 1 ,x 2 ) _ 1Q 



X„ 



n 



where 



Ex lG * M E X2 ex M Px 1 (xi)P X2 (x 2 )X(x 1 ,x 2 

J2k=i \Vh ~ {hk,iX! + h k ix 2 )\ 2 



1,2, 



A(xi, x 2 ) := exp 



(23) 



It is sufficient to derive an estimation formula with respect only to X\\ that for X 2 is given by 
the same form, except that hi and h 2 are interchanged. Specifically, in element-wise notation, 
the channel coefficients are exchanged as 



hlA O /li 2) hoi h 2: 



h Nr ,i -H- h 



N r ,2- 



(24) 



A. Approximation of Desired or Interfering Symbols 

Upon calculating (|22)) . we assume that either X 1 or X 2 is a RV having continuous pdf f(x) : 
C — >■ R, whereas the other is left intact. If this assumption is made for X\, X 1 of (1221) is 
expressed as 

E X2€Xm a(x 2 ) 



Xi 



where 



a(x2) := / x\(x,x 2 )f(x)dx 
Jc 

f3(x 2 ) := / \(x,X2)f{x)dx. 
Jc 



(25) 

(26) 
(27) 
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In this example detector, X\ is the desired symbol that is to be estimated. We refer to this type 
of detection in which the pdf approximation is performed to the desired symbol as a Type-I 
detection. 

Similarly, when this approximation is applied to X 2 , we obtain 

i Ex.€*„ *lff(*l) ,,~ 

Al = • <28) 

where f3'(x) is equal to f3(x) defined in (1271) with the variable interchanges defined in (l24l) . Note 
that X 2 can be seen as an interfering symbol to the detector of X\, and we refer to this type of 
detection that applies a pdf approximation to the interfering symbols as a Type-II detection. 

The function f(x) should meet the following two requirements: 1) it should well approximate 
the original probability distribution of a signal constellation, and 2) it should transform a(x) 
and (3(x) into closed-form expressions. In what follows, we examine several forms of f(x) and 
derive the corresponding Type-I and II detection formulae, which are summarized in Table HI 
Note that each formula is given in the log domain, i.e., in a form of logXi rather than X\ itself, 
in order to avoid computational overflow. 

B. Preliminaries 

We here define several auxiliary variables such that the subsequent mathematical expressions 
become concise. In Type-I, we will use 



w:=y H h 2 = J2ylhk,2, (29) 
fc=i 

u:= ||/ii|| 2 = X;|/i M r, (30) 
fe=l 

v:= \\h 2 \\ 2 = J2\hk,2\ 2 i (31) 

fc=i 

N r 

z := (y- h 2 x 2 ) H h 1 = ^(yk ~ h kj2 x 2 )*h k>1 , (32) 

k=l 

r z := \z\, (33) 
(fi z = argz. (34) 
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In Type-II, these variables are interpreted as 

N r 

w := y H hi = Y^Vk h k,i, (35) 
fc=i 

N r 

II, I |9 

u 



^ 2 || 2 = ^IV 2 | 2 ; (36) 
fc=i 

v.= ||/h|| 2 = 5^|/imI 2 » (3 7 ) 

N r 

z:=(y- h 1 x 1 ) H h 2 = ^(Vk - fyfc,i£i)*fyfc,2, (38) 

k=l 

r z := \z\. (39) 

Note that 2 does not appear in the Type-II formulae. In addition, we denote by C a constant 
that appears only in the derivation stage. 

In the detection formulae expressed in the log domain, we make use of special summations 
denoted by ^ K,log and ^ c,log . These are Jacobian logarithm summations in the real and complex 
domains, respectively, in which the ordinary addition is replaced by other operations as follows: 

K,log 

J^: a + b ->■ max(a,6) + log(l + e~ |a ~ fe| ), a,beR (40) 

C.log 

J^: a + 6^max Rc (a,&)+log(l + e- (maXRc(a ' 6) ^ minRc(a ' b)) ), a, b G C, (41) 

where maxR e (a, b) denotes the one whose real part is greater than the other, and minR C (a, b) is 
defined accordingly. The derivations of these summations are provided in Appendix Unl In the 
above expressions, neglecting the second terms with log and exp operations results in significant 
reduction of computational complexity at the cost of some performance degradation. In this 
simplified case, summation operations reduce to the max and maxR e operations. 

C. Gaussian Approximation 

To start with, we examine the Gaussian pdf, i.e., 

/(x)=exp(-|a;| 2 ). (42) 

Although this does not well approximate ordinary signal constellations such as QAM and PSK, 
with this the integrals are guaranteed to become a simple closed-form as demonstrated in deriving 
the linear MMSE from the conditional mean estimation in Section IIII-CI 
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The substitution of d42j) into d26j) and d27J) yields 

ot{x2) = / x\(x, X2) exp(— |x| 2 )<±c 



/2Re[u>x 2 ] - t>|a; 2 | 2 + \z\ 2 /(u + N )\ z* 
= Cex V at , AT ■ (43) 

/3(x 2 ) = / A(x, x 2 ) exp(— |x| 2 )c?x 



2ReK 2 ]-,|x 2 | 2 + NV(, + iV ) (44) 



1) Type-I: Substituting (|43T) and (|44|) into (1251) . we obtain the Type-I detection formula as 



z 



' 2Re [111x2] -v\ x 2 1 2 + 1 z 1 2 /(u+Nq ) " 

X x = (n + iVo)- 1 - "^'"" S S— ■ (45) 

V- OVT , f 2Re[ W x 2 ]-^|x 2 | 2 +| Z | 2 /(«+jVo) X 



AT 



In the log domain, we obtain 

, <>■ , / ,n v 2 ? /2ReK 2 ]-^|x 2 | 2 + |z| 2 /(n + iV0) . 
logXi = -log( M + iVo)+ > ^ ^ ' J - + hgr z -jt 

x 2 ex M V ^0 
^ S 2ReK 2 ]-^|x 2 | 2 + |z| 2 /(n + iVo) 
~ ^ ^ ' 



2) Type-II: Similarly, from (1281) . we obtain the Type-II detection formula as 

\- OVT ^ ( 2-Rc[wx 2 ]-v\x 2 \ 2 +\z\ 2 /{u+N ) 

2-^xiex M ex P ^ 



X 1 = — — ^ * ^ (47) 

f 2Rc[ W X2]-»|x2| 2 +|^| 2 /(n+^o) N 
iVo 



Its log domain form is given by 

logX 1= ^ ( ^ ] L_J_ — LLL1 ^ + io g | Xl | + Jarg 



12 1 UI2 



^ 2ReK 1 ]- W |x 1 | 2 +|z| 2 /(n + iVo) 
" ^ iVo • (48) 

Z). Uniform Square Approximation for QAM 

In this subsection, we consider a uniform square approximation which is suitable for rect- 
angular M-QAM, where M is an even power of 2. In this approximation, infinite number of 
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symbol points are uniformly distributed in the square region D K with the vertices at n + jn, 
—k + jn, — k — jn, k — jn. By taking the limit of M — > oo in the constellation of M-QAM: 

l)+2k 



M 



M-l) + 2n - VM 
+ 3- 



v/2(M-l)/3 a/2(M - X )/ 3 



< n, k < VM 



(49) 



we find that k = a/3/2. Using the unit step function, denoted by U(-), f(x) is expressed as 

f( x ) = — l — {U(Re[x} + k) - U(Re[x] - k)\ {U(1m[x\ + «) - C/(Imb] - re)} . (50) 
(2«) 



The substitution of <|50J) into d26j) and (ED) yields 
1 



a(x 2 ) 



(2k) 



x\(x, x 2 )dx 



_ /2Re[u>x 2 ] — v \x 2 \ 2 + \z\ 2 /u 
CeXP ( iVo 



N ( _ (+)2 

e 1 — e 

M7T 



Re 



it 



/iVo / _(+)2 



e Q 





V" 


) + Im 









(51) 



where 



1 



(2*) 2 
C exp 



A(x, x 2 )dx 



.(±) - 




w x 2 


2 + 


2 2 /M 


iVo 








V" 




^Re 




± re J 




it 





eie Q , 



(52) 



.(±) 




ei := erf ( r, 



.(+) 



erf ( Ti 



.(-) 



i 



ij Type-/: In (|5JJ), 



_ T (+) 2 -t(-> 2 

e i — e i 



iV 
erf (r<; 





~z*~ 


^Im 






u 


+ ')- 


erf 




(-)2\ 




Q 



± K 



(53) 
(54) 

are found to be neg- 



r (-) 

Q 



ligible even when signal-to-noise ratio (SNR) is small. Substituting (IBTI) and (|52|) into (1251) with 
dropping the negligible terms, we obtain 



x*ex„ ex P ( aL e i e Q 



Since e\ and eQ are positive, it can be expressed in the log domain as follows: 

C ' lo S /onJ..._ 1 12 , I 1 2 



(55) 



log(X 1 



2Re[wx 2 ] — w|x 2 | + \z\ ju 



o 



log u+ ^2 

K,l0g / _ r i i I O 119/ 

/ 2Re[wx 2 J — u|X2| + \z\ ju 



+ logei + loge Q + logr 2 



E 

X2&XM 



No 



+ log ei + log e c 



(56) 
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2) Type-II: Substituting (1521) into (1281) . we obtain the Type-II detection formula as 

Ef 2Re\wx2\— v\x2\ 2 + \ z \ 2 / u \ 
V p Yn ( 2 R e [™X2]-v\x2\ 2 + \z\ 2 /u \ p „ 



Its log domain form is given by 

/2Re[wxi] 

E log / 2Re[w;xi] — 
l - *- ^ ^^ + lo gei + loge Q . (58) 



v^ g f^Rekuxt] - vlxA 2 + \z\ 2 lu . . . , . 

logAi = I — hlogei + loge Q + log|xi| +jargxi 



ELloe 



£. Uniform Ring Approximation for PSK/APSK 

In this subsection, we consider a uniform ring approximation suitable for M-APSK whose 
constellation is in general expressed in terms of the number of rings K, the radii of the rings 
Pi, ... , p K , and the number of symbol points on each ring M k , k = 1, . . . , K, where 2~2k=i -^k = 
M. With these parameters, the constellation is defined as 

Xm = {p k e j2lTn/M \ < n < M k , k = 1, . . . , K) . (59) 

Note that M-PSK corresponds to the case with K — 1, p\ = 1, M\ = M. In the uniform ring 
approximation, infinite number of symbol points are uniformly distributed on each ring. Thus, 

1 - 

k=l 

where S(-) is the Dirac's delta function. 

With f(x) given as (1601) , it is easier to develop ([261) and (1271) in polar coordinates by replacing 
x with x = pk^P 6 ■ It follows that 



a(x 2 ) := 



71 k=i ^~ 7r 

A f 2Re[wx 2 ] - up 2 - v\x 2 \ 2 \ _ j4> (2p k r z \ 
C g exp ^ j p k e >*.I X j , (61) 



PM := 2^K ' ' ' X O>ke? $ ,X2)d0 



fc=i ^ ^ 

£7 £ exp ^W-^-N^ /o ^ (62) 
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where Iq(-) and ii(-) are respectively the zeroth and first order modified Bessel functions of the 
first kind. These Bessel functions can be well approximated as Io(x) ~ exp(x) and h(x) ~ 
exp(x). 

1) Type-I: Substituting (|6TT) and (l62l) into (1281) with each Bessel function replaced by its 
exponential approximation form, we obtain 

S^ K „ c -j<t>z ™ ( 2(Re[wx 2 ]+p k r :z )-upl-v\x2\ 2 \ 
2^x 2 eX M 2^k=l Pk e ex P I No ) 

v = _ L- (63) 

1 sr^K ( 2{Rc[wx 2 }+ p k r z )-upl~v\x 2 \ 2 N 

l^x 2 £X M 2^k=l ex P ^ 



No 



whose log domain form is 



kg*,- E E 

x?eX M k=l,...,K 



log C,log /r>/T-» r 1 \ 9 I 19 1 

/ 2(Re[wx 2 J + Pfc^) - upl - v\x 2 \ z + logp fc 



Ni 



o 



K,log IK, log n/T-. r l \ 9 i i9 

2(Re[wx 2 \+ p k r z ) -up z k -v\x 2 \ z .... 
^ N • (64) 

x 2 &X M k=l,...,K u 

As mentioned above, a PSK constellation corresponds to the one with K = 1 and p x = 1, which 
simplifies the detection formula as 

C,log /n/n r l \ \ K.log 



(65) 



. « ^ 8 ^ 2(Re[wx 2 ]+r,) . , \ ^ 2(Re[wx 2 ]+r 2 

xaeAfjf x ' x 2 £X M 

2) Type-II: The substitution of (|62|) into (1281) yields the Type-II formula as 

_ ST K ov ™ f 2 (R- c [ W3: - , i]+Pfcr z )-Mp|- t )|3;i| 2 N \ 

OYT , ( 2(Re[wa; 1 ]+p fc r-)-up|-^|xi| 2 \ 
Z-fxieATjif <Efc=l ex P \ N J 

In the log domain, 

'2(Re[u>£i] + p k r z ) - up\ - w|xi| 2 + log \x x \ 



log*,- E E 



iVn 



J argxi 



Ev- 2(Re[u;xi] + p k r z ) - up\ - v\xi\ 2 
^ N. ' <67) 



xi€X M k=l,...,K 

For the PSK constellations, we obtain 

logXx = 2^ ( Tr +jargxij- ^ 



2(Ke[wxi] + r z 



N 



(68) 
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V. Simulation Results 

In this section, we evaluate the bit error rate (BER) performance of the proposed signal 
detection method via computer simulation. The fading coefficients hj. >n , k = 1, . . . , N r , n = 1,2 
are assumed to be mutually-independent complex Gaussian RVs each with zero mean and unit 
variance. The SNR per information bit is denoted by E^/Nq. 

Figures US and @ contain the plot of BER curves of QPSK, 8-PSK, and 16-PSK detected 
based on the uniform ring approximation, where it can be seen that the performances of both 
Type-I and II gradually deviate from that of the linear MMSE, and the Type-I detection always 
outperforms the Type-II detection. At BER = 1CT 4 , the required SNR is improved by 11 or 
12dB over the linear MMSE in the case of Type-I, even approaching the performance of MLD 
within 3 dB. In these figures, the performances of the Type-I and II detections, where the Jacobian 
logarithm summations are replaced by the max and maxR e operations (see Section IIV-BI) . are 
also plotted (labeled with "max" in the legend). The performance gap between these approaches 
is as small as 0.5 dB in the cases of QPSK and 8-PSK with Type-I detection, and even becomes 
negligible in the other cases. We thus remark that this simplification of the Jacobian summations 
is a preferable option in practice, as it significantly reduces complexity. 

In Fig. [51 we make a comparison of BER performances with increasing N r from 2 to 4, for 8- 
PSK detected by the Type-I uniform ring approximation. As observed, increasing N r remarkably 
reduces the SNR gaps from MLD: there is a loss of as small as 0.8 dB in N r = 3, and there is 
no noticeable loss in N r = 4. 

In Fig. [6l we plot the BER performance of a 16-APSK constellation with the parameters of 
K — 2, px — a/2/5, p 2 = 2 a/2/5, and Mi = M 2 = 8, which corresponds to a popular setting 
of the 16-APSK consisting of two concentric 8-PSK with the ring ratio of 1 : 2. We can observe 
in the BER curves a tendency similar to that of the PSKs. The performance gap between Type-I 
and MLD is, however, slightly larger than that in the PSK cases. 

The detection based on the uniform square approximation is evaluated in Fig. [7] with N r — 2 
and 16-QAM as an example. Similar to the PSK/APSK cases, Type-I clearly outperforms Type- 
II, but their BER curves show that achievable diversity order is not as large as that in PSK/APSK. 
As a result, compared with the linear MMSE, the SNR improvement at BER = 1CT 4 is limited 
to 3.5 dB even in Type-I. 
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Unlike the uniform square and ring approximations, the Gaussian one does not target a specific 
constellation. We therefore examine its applicability to both 8-PSK and 16-QAM. Figure [8] 
compares the Type-I detection of 8-PSK with both Gaussian and uniform ring approximation. 
Noticeable gap is observed, suggesting the considerable dependence of the symbol distribution 
approximation on the overall performance. Shown in Fig. [9] is the case for 16-QAM, Type-I, 
with both Gaussian and uniform square approximation. In the latter case, the gap becomes small, 
but the square approximation still outperforms Gaussian. 

Lastly, as a simple demonstration of low-complexity aspect of the proposed detection frame- 
work, in Table [II] we compare average processing time for detecting 1,000,000 symbols in our 
simulation program written in C++. Here, we focus only on the PSKs with the uniform ring 
approximation which has a BER performance close to that of MLD. As a reference, in the table 
the number of multiplications 3 for iteration and the number of iterations required for detecting 
one symbol are also listed. As observed, the processing time of the Type-I and II detections are 
significantly lower than that of MLD. 

VI. Conclusion 

In this paper, we have proposed a low-complexity detection scheme for MIMO-SM systems, 
based on the generalized MMSE detection derived from the conditional mean estimation expres- 
sion. Our main focus is on the case with two transmit antennas, but the proposed scheme can 
be extended to the general system having any number of transmit antennas in a straightforward 
manner. The reduction of complexity is due to the approximation of the probability distribution 
in the conditional mean estimation expression such that the resulting formula in part has a 
simple closed form. The proposed detection has two sub-classes: in Type-I the desired symbol is 
approximated, whereas the approximation is in turn applied to the interfering symbol in Type-II. 
By simulation, we have observed that the Type-I detection is superior to Type-II. Furthermore, in 
the case of PSK and APSK, the former detection with the uniform ring approximation achieves 

'Note that the number of real multiplications is displayed. A complex multiplication accounts for 4 real multiplications, but 
it is reduced to 2 when followed by Re. An absolute square of a complex number takes 2 real multiplications. For MLD, the 
multiplication count is half the actual one since it can detect Xi and Xi simultaneously. For the Type-I detection, we note that 
computational burden is also on decomposing a complex number into its phase and amplitude, whereas for Type-II, the phase 
is not necessary and thus its processing time is lower than that of Type-I. 
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a BER performance even comparable to MLD. Finally, as a possible future work, we note 
that there may be a further room of complexity reduction and BER improvement by utilizing 
approximations other than those examined in this paper. 

Appendix I 

Equivalence of Two Expressions of Linear MMSE 
A proof of the equivalence between (flOl) and (fTTj) proceeds as follows: 

(H H H + N Q I Nt y X H H 

J_I _ J_h h ( —HH H + I N ) 1 — H\ 

No Nt N \N Nr J N 

1 //" _ ± H " ( —////" + I N \ 1 HH H 



-_± H H ( -^HH H + I A 1 (^HH H + I N \-^H H (^HH H + I N \ —HH» 



* H { No {k HHH+I *)} lj * 

H H (HH H + iV //v r )" 1 • (69) 

Appendix II 
Proof of Proposition Q] 

Substituting p x (x) = ^-e"" 3 '" 2 into (fT9l) , we obtain 

/ c ^.exp( J^y iWP )^ 



X 



JcNt exp d * 



r / -~y H Hx~(y H Hx)*+x H H H Hx+Nox H x \ ,]\f t 

JC N t X ' eX P \ TVo / a X 



(70) 



r / — y H Hx— (y H Hx)*+x H H H Hx+Nnx H x \ jm. _ 

Jciv, exp — 2 a g J 

Let Q be the unitary matrix that diagonalizes the Hermitian matrix H H H as 

QH H HQ H = A, (71) 
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where A is a diagonal matrix that gathers the eigenvalues of H H H, X i7 1 < i < N t . With the 
unitary matrix Q, we translate the problem into finding Z := QX instead of X. Changing the 
variable x to Q H x in d70l) then yields Z as 

Z = — A- — — : ^ —J- • (72) 



r (-yH HQH X -(yH H QH x) * +x H( A+NoI)x) } ^ 

j £Nt exp y Wq j a 

.H 1 1 /-'til 



X 



Let Oj denote the z-th element in the row vector y HQ . The i-th element in Z, denoted by 
Zi, is then written as 

A _ / c x ■ exp rfx n ^ /c exp (- —n-^^)-- ) <fc 



n£i / c ex p (■ 



-a n x~a^x*+(X n +No)xx* \ i 
N ' aX 



Noticing that 



-a n x - a*x* + (A n + N )xx* . 
exp | ~ I ax 



exp 



exp 



An + iVp 

U |2 



.r 



(A„ + No) 



dx 



7lN 



and that 



No{\ n + No)J \ n + No 

—diX - a*x* + (Aj + N )xx* 



x ■ exp 



exp 



N 

ttN a* 



dx 



No(K + N )J Xi + NoXi + No 
731) can be simplified as we obtain 

a* 



Z, 



Xi + N n 



Thus, 



Ai+7Vo 



_X Nt +N 



(A + NoI Nt y\y H HQ H ) H = (A + N I Nt )- L QH H y. (77) 



i/^ t/ff. 



(73) 



(74) 



(75) 



(76) 
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Finally we invert the transformation Z = QX to obtain 

X = Q H Z = Q H (A + NvI Nt Y l QH H y = Q H (QH H HQ H + N Q I Nt y l QH H y 

= Q H (Q(H H H + N I Nt )Q H )~ 1 QH H y = Q H Q H (H H H + N I Nt y l Q^QH H y 
= (H H H + N I Nt )- 1 H H y, (78) 

S v ' 

=Gmmse 

is precisely the linear MMSE detection form of © with (fTTI) . 

Appendix III 
Jacobian Logarithm 

Let A and B be real- valued and greater than zero, and suppose a := log A and b := log B. 
The Jacobian logarithm appears as a core formula in the log domain implementation of the MAP 
decoding algorithm [18], with which one can write log(v4 + B) in terms of a and b as 

\og(A + B)= log (A(l + B/A)) =a + log(l + B/A) = a + log(l + e b /e a ) 

= a + log(l + e-( a - fe )), (79) 

or as 

\og(A + B) = 6 + log(l + e- (b - a) ). (80) 

Notice that in (|79l) the exponential e~^ b ^ < 1 for the case when a — b > 0, whereas in (f8~0l) 
e -{b-a) < J for a — 6 < 0. Hence, (1791 and ( f80l ) are guaranteed not to overflow within the 
corresponding ranges of a — b. Collecting the equations and conditions in a compacted form, we 
have 

\og(A + B)= max(a, b) + log (l + e" 1 ^ 1 ) (81) 

which never overflows for any values of a and b. 

The Jacobian logarithm in the real domain presented above is widely known. To extend it 
to the complex domain, we observe that the equations (1791) and (l8~0l) are also valid for a and b 
being complex numbers. The conditions for satisfying |e~ < - a ~ b )| < 1 and |e~^~ a ^| < 1 are given 
by Re [a] — Re [b] > and Re [a] — Re [b] < 0, respectively, from which the complex domain 
Jacobian logarithm follows as 

\og(A + B) = max Rc (a, b) + log (l + e -(™x Ro (a,b)-min Rc (a,6))) _ (g2) 
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MIMO channel 



Fig. 1. The proposed MIMO System model. The transmitter with Nt = 2 is depicted as an example. 



TABLE I 

List of the derived detection formulae. 



Shape of f(x) 


Gaussian 


1 

uniform square 


uniform ring 


Target constellation 


unspecified 


QAM 


APSK 


PSK 


type-I formula 


{46} 


{56} 


m 


mi 


type-II formula 




m 


{67} 


{68} 



TABLE II 

Average time (sec) for processing 1,000,000 symbols in our simulation program written in C++ (N r = 2). 
The values in parentheses are without the simplification of Iacobian summation. 





Iteration 


Multiplication per iter. 


QPSK 


8-PSK 


16-PSK 


Linear MMSE 




4iVr- 


0.57 


0.58 


0.54 


MLD 


M 2 


5N r 


1.32 


2.26 


5.52 


uniform ring type-I 


M 


12N r + 3 


0.90 (1.65) 


1.37 (2.95) 


2.27 (5.88) 


uniform ring type-II 


0.71 (1.35) 


1.10 (2.55) 


1.82 (5.03) 
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— • — Linear MMSE : 



MLD 

type-l 




E b /N dB 



Fig. 2. BER curves of QPSK detected by the Type-I and II formulae with the uniform ring approximation. The number of 
receive antennas is N r — 2. 
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Fig. 3. BER curves of 8-PSK detected by the Type-I and II formulae with the uniform ring approximation. iV r = 2. 
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■ Linear MMSE 

■ MLD 
type-l (max) 




15 20 25 

E b /N dB 



40 



Fig. 5. BER comparison across N r = 2, 3, and 4 for 8-PSK with the Type-I uniform ring approximation. 
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Fig. 6. BER curves of 16-PSK detected by the Type-I and II formulae with the uniform ring approximation. N r = 2. 
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Fig. 7. BER curves of 16-QAM detected by the Type-I and II formulae with the uniform square approximation. N r = 
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Fig. 8. BER comparison between the Gaussian and uniform ring approximations for 8-PSK. N r = 2. 
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Fig. 9. BER comparison between the Gaussian and uniform square approximations for 16-QAM. N r = 2. 
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